The XOO7 Benchmark
نویسندگان
چکیده
As XML becomes the standard for electronic data interchange, benchmarks are needed to provide the comparative performance analysis of XML management systems (XMLMS). Typically a benchmark should adhere to four criteria: relevance, portability, scalability and simplicity [1]. The data structure of a benchmark for XML must be complex enough to capture the characteristics of XML data representation. Data sets should be in various sizes. Benchmark queries should only be defined with the primitives of the language. XML models present similarities with object-oriented data models. While XML is able to handle semi-structured data, it supports most of the features of complex object models. Classes, methods and inheritance are not defined in XML but classes can be expressed through element types and attributes. Thus in developing a benchmark for XML, we decided to use the well-established OO7 benchmark [3] designed for object-oriented database management system as a starting point. The XOO7 benchmark, an XML version of the OO7 benchmark, is a singleuser based benchmark for XMLMS that focuses on the query processing aspect of XML. The DTD and data set of XOO7 are directly obtained by mapping the OO7 schema and data set to XML [4]. OO7 does not model any specific application, but it intends to capture the characteristics of an object-oriented database. Additionally, in order to cater for the document centric view of XML, we extended the document object of OO7 to contain sub-elements mixed with text data. Thus, the Document element provides for a liberal use of free-form text that is "marked up" with elements. Therefore the XOO7 data set can capture all the characteristics of typical XML database applications. We provide a parameterized program to generate XML databases of various sizes and characteristics. The XOO7 benchmark also extends and modifies the eight OO7 queries with a set of twenty-three queries. XOO7 provides relational, document and navigational queries that are specific and critical for XML database applications. The queries test the primitive features and each query covers only a few features. XOO7 queries are defined to express the requirements published by the W3C XML Query Language working group. XOO7 queries are therefore supported by most XMLMS, which makes them very portable. Users can choose a subset of queries: data centric, document or navigational, to test on the features required in their applications [2, 5]. XOO7 has been implemented and used to evaluate two XML-enabled management systems: LORE, XENA and two native-XML management systems: Kweelt, DOM-XPath1. The experiment results and the analysis can be found in [5]. The up-to-date information about the XOO7 benchmark can be found at the web site: http://www.comp.nus.edu.sg/~ebh/XOO7.html.
منابع مشابه
The XOO7 XML Management System Benchmark
As XML becomes the standard for electronic data interchange, it is necessary to design benchmarks to provide for the comparative performance analysis of XML management systems (XMLMS). In this work, we propose XOO7, a benchmark for XMLMS. The XOO7 benchmark is an XML version of the OO7 benchmark [11] enriched with relational, document and navigational queries that are specific and critical for ...
متن کاملXOO7: Applying OO7 Benchmark to XML Query Processing Tools
If XML is to play the critical role of the lingua franca for Internet data interchange that many predict, it is necessary to start designing and adopting benchmarks allowing the comparative performance analy sis of the tools being developed and proposed. The effectiveness of existing XML query languages has been studied by many who focused on the comparison of linguistic features, implicitly re...
متن کاملBenchmarking XML Management Systems: The XOO7 Way
The effectiveness of existing XML query languages has been studied by many who focused on the comparison of linguistic features, implicitly reflecting the fact that most XML tools exist only on paper. In this paper, with a focus on efficiency and concreteness, we propose a pragmatic first step toward the systematic benchmarking of XML query processing platforms. We begin by identifying the nece...
متن کاملMulti-user Evaluation of XML Data Management Systems with XMach-1
XMach-1 was the first XML data management benchmark designed for general applicability [1]. It is still the only benchmark supporting a multiuser performance evaluation of XML database systems. After a brief review of XMach-1 we summarize three additionally proposed benchmarks (XMark, XOO7, Mbench) and provide a comparison between these benchmarks. We then present experiences and performance re...
متن کاملBenchmarking of Native XML Database Systems
In this thesis, XML database systems, standard database benchmark techniques, and XML database system benchmarks such as XBench, XMark, XMach-1, and XOO7 are reviewed. A new benchmark system called MyBench is provided for benchmarking of native XML database systems. First, graph grammars are utilised to define a group of productions with different tree structures, then the algorithm for generat...
متن کامل